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The Biotechnology Systems Branch of the Scientific and Technical Information 
Center (STIC) detected errors when processing the following computer readable 
form: 

Application Serial Number: 

Source: /^l/Z/O ~ 

Date Processed by STIC: jg 1^/2001- 

THE ATTACHED PRINTOUT EXPLAINS DETECTED ERRORS. 
PLEASE FORWARD THIS INFORMATION TO THE APPLICANT BY EITHER: 

1) INCLUDING A COPY OF THIS PRINTOUT IN YOUR NEXT COMMUNICATION TO THE 
APPLICANT, WITH A NOTICE TO COMPLY or, 

2) TELEPHONING APPLICANT AND FAXING A COPY OF THIS PRINTOUT, WITH A ; 
. NOTICE TO COMPLY 

FOR CRF SUBMISSION QUESTIONS, PLEASE CONTACT MARK SPENCER, 703-308-4212. 

FOR SEQUENCE RULES INTERPRETATION, PLEASE CONTACT ROBERT WAX, 703-308-4216. 
PATENTIN 2.1 e-mail help: patin21help@uspto.gov or phone 703-306-4119 (R. Wax) 
PATENTIN 3.0 e-mail help: patin3help@uspto.gov or phone 703-306-41 19 (R. Wax) 

TO REDUCE ERRORED SEQUENCE LISTINGS, PLEASE USE THE CHECKER 
VERSION 3.1 PROGRAM , ACCESSIBLE THROUGH THE U.S. PATENT AND 
TRADEMARK OFFICE WEBSITE. SEE BELOW FOR ADDRESS: 
http://www.uspto.gov/web/offices/pac/checker 

Applicants submitting genetic sequence information electronically on diskette or CD-Rom should be aware that there 

a possibility that the disk/CD-Rom may have been affected by treatment given to all incoming mail. 

Please consider using alternate methods of submission for the disk/CD-Rom or replacement disk/CD-Rom. 

Any reply including a sequence listing in electronic form should NOT be sent to the 20231 zip code address for the 

United States Patent and Trademark Office, and instead should be sent via the following to the indicated addresses: 

1. EFS-Bio (<http://www.uspto.gov/ebc/efs/downloads/documents.htm> , EFS Submission 
User Manual - ePA VE) 

2. U.S. Postal Service: U.S. Patent and Trademark Office, Box Sequence, P.O. Box 2327, Arlington, VA 22202 

3. Hand Carry directly to: 

U.S. Patent and Trademark Office, Technology Center 1600, Reception Area, 7 th Floor, Examiner Name, 
Sequence Information, Crystal Mall One, 191 1 South Clark Street, Arlington, VA 22202 
Or 

U.S. Patent and Trademark Office, Box Sequence, Customer Window, Lobby, Room 1B03, Crystal Plaza Two, 
201 1 South Clark Place, Arlington, VA 22202 

4. Federal Express, United Parcel Service, or other delivery service to: U.S. Patent and Trademark Office, 
Box Sequence, Room 1B03-Mailroom, Crystal Plaza Two, 201 1 South Clark Place, Arlington, VA 22202 
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RAW SEQUENCE LISTING DATE: 06/04/2002 ^ 

PATENT APPLICATION: US/10/048 , 14 6A TIME: 16:18:15 ^ < 



Input Set : A:\62068.app 

Output Set: N:\CRF3\06042002\J048146A.raw 



te® Hox Comnipty 
GorRgeted Diskette Graded 



3 <110> APPLICANT: Tsang et al . 

5 <120> TITLE OF INVENTION: METHODS AND COMPOSITIONS FOR DETECTING LARVAL TAENIA SOLIUM 
7 <130> FILE REFERENCE: 6395-62068 
9 <140> CURRENT APPLICATION NUMBER: 10/048, 146A 
10 <141> CURRENT FILING DATE: 2002-04-18 

12 <150> PRIOR APPLICATION NUMBER: US 60/147,318 

13 <151> PRIOR FILING DATE: 1999-08-03 

15 <150> PRIOR APPLICATION NUMBER: PCT/US00/21173 

16 <151> PRIOR FILING DATE: 2000-08-03 
18 <160> NUMBER OF SEQ ID NOS : 9 

20 <170> SOFTWARE: Patentln version 3.1 

22 <210> SEQ ID NO: 1 

23 <211> LENGTH: 2153 

24 <212> TYPE: DNA 

25 <213> ORGANISM: Taenia solium 

27 <220> FEATURE: 

28 <221> NAME/KEY: CDS 

29 <222> LOCATION: (145).. (531) 

30 <223> OTHER INFORMATION: 

33 <400> SEQUENCE: 1 

34 ctgcagtgaa gttgacaagt agttgaccat ttacggaaca tcaatggagg acactttggt 60 
36 agggaaagca tacgataaac ataaaccaat gctggttata taagagacga tctcggctac 120 

38 acttgtaact gaacaacctg taga atg cgt gcc tac att gtg ctt etc get 171 

39 Met Arg Ala Tyr lie Val Leu Leu Ala 

40 15 



42 


etc 


act 


gtt 


ttc 


gta 


gtg 


acg 


gtg 


teg 


gcc 


gag 


tgg 


gtg 


ccc 


att 


teg 


219 


43 


Leu 


Thr 


Val 


Phe 


Val 


Val 


Thr 


Val 


Ser 


Ala 


Glu 


Trp 


Val 


Pro 


He 


Ser 




44 


10 










15 










20 










25 




46 


agt 


gtc 


cac 


ata 


gcc 


tea 


tgc 


aaa 


age 


cac 


tac 


atg 


ttc 


caa 


tta 


aaa 


267 


47 


Ser 


Val 


His 


He 


Ala 


Ser 


Cys 


Lys 


Ser 


His 


Tyr 


Met 


Phe 


Gin 


Leu 


Lys 




48 










30 










35 










40 






50 


cgc 


ttt 


ttt 


gcc 


ttt 


agg 


aaa 


aac 


aaa 


ccg 


aaa 


gat 


gtt 


gca 


aat 


agt 


315 


51 


Arg 


Phe 


Phe 


Ala 


Phe 


Arg 


Lys 


Asn 


Lys 


Pro 


Lys 


Asp 


Val 


Ala 


Asn 


Ser 




52 








45 










50 










55 








54 


acg 


aaa 


aaa 


ggg 


ata 


gaa 


tat gtc cac gaa ttc 


ttc 


cac. 


gaa 


gac 


ccg 


363 


55 


Thr 


Lys 


Lys 


Gly 


He 


Glu 


Tyr Val His Glu Phe 


Phe 


His 


Glu 


Asp 


Pro 




56 






60 










65 










70 










58 


att 


ggt 


aaa 


caa 


att 


get 


caa 


etc 


gca 


aag 


gaa 


tgg 


aag 


gaa 


gca 


atg 


411 


59 


He Gly 


Lys 


Gin 


He 


Ala 


Gin 


Leu 


Ala 


Lys 


Glu 


Trp 


Lys 


Glu 


Ala 


Met 




60 




75 










80 










85 












62 


ttg 


gaa 


ggt 


agg 


ttt 


tgg 


tgt 


ttt 


ctg 


tea 


gaa 


gaa 


aat 


tat 


eta 


ttc 


459 


63 


Leu 


Glu 


Gly Arg 


Phe 


Trp 


Cys 


Phe 


Leu 


Ser 


Glu 


Glu 


Asn 


Tyr 


Leu 


Phe 




64 


90 










95 










100 










105 
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RAW SEQUENCE LISTING DATE: 06/04/2002 

PATENT APPLICATION: US/10/048 f 14 6A TIME: 16:18:15 

Input Set : A:\62068.app 

Output Set: N:\CRF3\06042002\J048146A.raw 

66 att cat eta gac aaa ggc aaa ata egg acg tea ctg gtt gag cac tgc 507 

67 lie His Leu Asp Lys Gly Lys lie Arg Thr Ser Leu Val Glu His Cys 

68 110 115 120 

70 aaa ggt cct aag aaa aaa act get taacttgtca actttcatgc gttcttctct 561 

71 Lys Gly Pro Lys Lys Lys Thr Ala 

72 ' 125 

74 tcactaataa atgetcatta ataagaaagc tgccttttgc aagatcaacg agggecatag 621 

76 actgtgaggg ttatagecta aggttatggg gtgaaatgag ataggaattg agcatttgag 681 

78 aagttactaa tttaaattga aagcegcatt tettctgeaa ttgacgtgtg atggttagcg 741 

80 aaaccaagtg aagcacgacc tcttgagtcg tttcaacagc cgccagtggt ttcaccagtg 801 

82 gcttcaccag. tgggtagact ggtttgtcac acatgegagg tacggtcaga gggctaacag 861 

84 gtgtggtgga ggggecaaca cgtgtaagac aagcagttcc ccttctctgt cgtgaggcac 921 

86 actcagcacc cacctcgttt acttctccct tgacgactgt aatgcatttg gggtcaccat 981 

88 gcccccgcca agttgaaggc actgatgaca tttgtaccat atcaccgata agtattaact 1041 

90 cttccacttc ccagattttg aggtcaggcg atcctactga ctcggtgtag ccccatggtg 1101 

92 gtccatgctc tgcaccattc gctgttcagt ggagcatcca cctagacggc caaccaatct 1161 

94 cgcctccctt ctcctgtgct caagatgtgc gtcggtgaga tttggagggt ctgatcacca 1221 

96 tactaaccac gtaggtttca tcatctctaa gaagcaccac ttcttgaggt cgcattgtgt 1281 

98 accaccagcc ggtgtaatca agagtgactt tcgcgtcacc cctaagaagg ctatagatct 1341 

100 gcaagtcagc gcaatagctt cagccatgct gactaaaatg tgtaagggac cagtagctct 1401 

102 agcccaacac aagtggagct aataatgggc ttccccagat acatgaatcc caaateggtg 1461 

104 ageatgggee atgaatatgg ccttctgagt cttccttgaa tgcaaacgaa ggcatagcac 1521 

106 gagggtagga tgagtgtaca gaaaacagcg aggcaacgaa tctactggca tggecctgat 1581 

108 gccaccccgc ccagctaggg tagtttggcc acctcagtcc ttaatcgaat geggcagtea 1641 

110 gaacaaacaa agtattacat agccacactc ttcttttgag cgtcgtcctc gacgctcctt 1701 

112 tcgacacacc tcccgcatca gccaccacaa agtaatcagt actggggaga cacccacgag 1761 

114 ctaaccgtgc cagtcatgga aaatttgacg gcaactgagg agatgectga ccccctttgg 1821 

116 cagttcgaat gctgcccgtg gtcaaactcc tgcatcagcc atcacctacg attcaaacat 1881 

118 cctagtcgcc aaattttcgt gaaccctcta aaattttcgt gcactctcaa gacacttcca 1941 

120 actgacttag agctttttca tttggtgaga acaegtaaaa gcttcaagta aacaacaggc 2001 

122 aacgatttca etttgatget ctcaccatca attctcttgt atgtgccacc accttaaacc 2061 

124 ctccctgacc acttccactc tctctctctc cctaaataac aacacttgga agcatgaatg 2121 

126 gtgtctgtca aagttacacc cctagactgc ag 2153 

129 <210> SEQ ID NO: 2 

130 <211> LENGTH: 129 

131 <212> TYPE: PRT 

132 <213> ORGANISM: Taenia solium 
134 <400> SEQUENCE: 2 



136 


Met 


Arg 


Ala 


Tyr 


He 


Val 


Leu 


Leu 


Ala 


Leu 


Thr 


Val Phe 


Val 


Val 


Thr 


137 


1 






5 










10 








15 




140 


Val 


Ser 


Ala 


Glu 


Trp 


Val 


Pro 


He 


Ser 


Ser 


Val 


His He 


Ala 


Ser 


Cys 


141 








20 










25 








30 






144 


Lys 


Ser 


His 


Tyr 


Met 


Phe 


Gin 


Leu 


Lys 


Arg 


Phe 


Phe Ala 


Phe 


Arg 


Lys 


145 






35 










40 








45 








148 


Asn 


Lys 


Pro 


Lys 


Asp 


Val 


Ala 


Asn 


Ser 


Thr 


Lys 


Lys Gly 


lie 


Glu 


Tyr 


149 




50 










55 










60 








152 


Val 


His 


Glu 


Phe 


Phe 


His 


Glu 


Asp 


Pro 


He 


Gly 


Lys Gin 


He 


Ala 


Gin 


153 


65 










70 










75 








80 


156 


Leu 


Ala 


Lys 


Glu 


Trp 


Lys 


Glu 


Ala 


Met 


Leu 


Glu 


Gly Arg 


Phe 


Trp 


Cys 
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PATENT APPLICATION: US/10/048 , 14 6A TIME: 16:18:15 

Input Set : A:\62068.app 

Output Set: N:\CRF3\06042002\J048146A.raw 

157 85 90 95 

160 Phe Leu Ser Glu Glu Asn Tyr Leu Phe lie His Leu Asp Lys Gly Lys 

161 100 105 . 110 

164 He Arg Thr Ser Leu Val Glu His Cys Lys Gly Pro Lys Lys Lys Thr 

165 115 120 125 
168 Ala 

172 <210> SEQ ID NO: 3 

173 <211> LENGTH: 298 

174 <212> TYPE: DNA 

175 <213> ORGANISM: Taenia solium 

177 <220> FEATURE: 

178 <221> NAME/KEY: CDS 

179 <222> LOCATION: (3).. (224) 

180 <223> OTHER INFORMATION: 

183 <400> SEQUENCE: 3 

184 ta ttc gta gtg gcg gtt teg gec gag aaa aac aaa ccg aag tgt gat 47 

185 Phe Val Val Ala Val Ser Ala Glu Lys Asn Lys Pro Lys Cys Asp 

186 15 10 15 

188 gca aat agt act aag aaa gag ata gaa tat ate cac aat tgg ttt ttc 95 

189 Ala Asn Ser Thr Lys Lys Glu He Glu Tyr He His Asn Trp Phe Phe 

190 20 25 30 

192 cat gat gac ccg att gga aaa caa att get caa etc gca aag gac tgg 143 

193 His Asp Asp Pro He Gly Lys Gin He Ala Gin Leu Ala Lys Asp Trp 

194 35 40 45 

196 aat gaa aca gtg cag gaa gee aaa ggc aaa ttt tgg gcg tea ctg get 191 

197 Asn Glu Thr Val Gin Glu Ala Lys Gly Lys Phe Trp Ala Ser Leu Ala 

198 50 55 60 

200 gag tac tgc aga ggt ctg aag aac aaa act get taacttgtca actttcatgc 244 

201 Glu Tyr Cys Arg Gly Leu Lys Asn Lys Thr Ala 

202 65 70 ( 

204 gttcttctct tcaccaataa atgetgatta acaagaaaaa aaaaaaaaaa aaaa 298 

207 <210> SEQ ID NO: 4 

208 <211> LENGTH: 74 

209 <212> TYPE: PRT 

210 <213> ORGANISM: Taenia solium 
212 <400> SEQUENCE: 4 



214 


Phe 


Val Val Ala 


Val 


Ser 


Ala 


Glu 


Lys 


Asn 


Lys 


Pro 


Lys 


Cys 


Asp 


Ala 


215 


1 




5 










10 










15 




218 


Asn 


Ser Thr Lys 


Lys 


Glu 


He 


Glu 


Tyr 


He 


His 


Asn 


Trp 


Phe 


Phe 


His 


219 




20 










25 










30 






222 


Asp 


Asp Pro He 


Gly 


Lys 


Gin 


He 


Ala 


Gin 


Leu 


Ala 


Lys 


Asp 


Trp 


Asn 


223 




35 








40 










45 








226 


Glu 


Thr Val Gin 


Glu 


Ala 


Lys 


Gly 


Lys 


Phe 


Trp 


Ala 


Ser 


Leu 


Ala 


Glu 


227 




50 






55 










60 










230 


Tyr 


Cys Arg Gly 


Leu 


Lys 


Asn 


Lys 


Thr 


Ala 














231 


65 






70 






















234 


<210> SEQ ID NO: 


: 5 

























235 <211> LENGTH: 294 

236 <212> TYPE: DNA 
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RAW SEQUENCE LISTING DATE: 06/04/2002 

PATENT APPLICATION: US/10/04 8 , 14 6A TIME: 16:18:15 

Input Set : A:\62068.app 

Output Set: N:\CRF3\06042002\J048146A.raw 

237 <213> ORGANISM: Taenia solium 

239 <220> FEATURE: 

240 <221> NAME/KEY: CDS 

241 <222> LOCATION: (3).. (221) 

242 <223> OTHER INFORMATION: 

245 <400> SEQUENCE: 5 

246 tt ttc gta gtg gcg gtg teg gec gag gaa act aaa cca gag gac gtg 47 

247 Phe Val Val Ala Val Ser Ala Glu Glu Thr Lys Pro Glu Asp Val 

248 15 10 15 

250 gta aag aat att aag aaa ggg atg gaa gtt gtc tac aaa ttt ttc tac 95 

251 Val Lys Asn He Lys Lys Gly Met Glu Val Val Tyr Lys Phe Phe Tyr 

252 20 25 30 

254 gaa gac ccg ttg gga aag aaa ata get caa etc gca aag gac tgg aag 143 

255 Glu Asp Pro Leu Gly Lys Lys He Ala Gin Leu Ala Lys Asp Trp Lys 

256 35 40 45 

258 gaa gca atg ttg gaa gec aga age aaa gtg egg gcg tea ctg get gag 191 

259 Glu Ala Met Leu Glu Ala Arg Ser Lys Val Arg Ala Ser Leu Ala Glu 

260 50 55 60 

262 tac ate aga ggt etc aag aac gaa get get taacttgtca actttcatgc 241 

263 Tyr He Arg Gly Leu Lys Asn Glu Ala Ala 

264 " 65 70 

266 gttcttctct tcactaataa atgetcatta ataagaaaaa aaaaaaaaaa aaa 294 

269 <210> SEQ ID NO: 6 

270 <211> LENGTH: 73 

271 <212> TYPE: PRT 

272 <213> ORGANISM: Taenia solium 
274 <400> SEQUENCE: 6 

276 Phe Val Val Ala Val Ser Ala Glu Glu Thr Lys Pro Glu Asp Val Val 

277 1 5 10 15 

280 Lys Asn He Lys Lys Gly Met. Glu Val Val Tyr Lys Phe Phe Tyr Glu 

281 20 25 30 

284 Asp Pro Leu Gly Lys Lys He Ala Gin Leu Ala Lys Asp Trp Lys Glu 

285 35 40 45 

288 Ala Met Leu Glu Ala Arg Ser Lys Val Arg Ala Ser Leu Ala Glu Tyr 

289 50 55 60 

292 He Arg Gly Leu Lys Asn Glu Ala Ala 

293 65 70 

296 <210> SEQ ID NO : 7 

297 <211> LENGTH: 6 

298 <212> TYPE: PRT 

299 <213> ORGANISM: Taenia solium 
301 <400> SEQUENCE: 7 

303 He Ala Gin Leu Ala Lys 

304 1 5 

307 <210> SEQ ID NO: 8 

308 <211> LENGTH: 24 

309 <212> TYPE: PRT 

310 <213> ORGANISM: Taenia solium 
312 <220> FEATURE: 
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313 
314 
315 
318 
319 
320 
321 
324 
325 
326 
327 
330 
331 
332 
333 
336 
337 
338 
339 
342 
343 
344 
345 
348 
350 
351 



359 
360 
361 
363 
364 
365 
366 
369 
370 
371 
372 
375 
376 
377 
378 
381 
382 
383 
384 
387 



RAW SEQUENCE LISTING 

PATENT APPLICATION: US/10/04 8 f 14 6A 



DATE: 06/04/2002 
TIME: 16:18:15 



Input Set : 
Output Set: 



A:\62068.app 
N:\CRF3\06042002\J04 



<221> 
<222> 
<223> 
<220> 
<221> 
<222> 
<223> 
<220> 
<221> 
<222> 
<223> 
<220> 
<221> 
<222> 
<223> 
<220> 
<221> 
<222> 
<223> 
<220> 
<221> 
<222> 
<223> 
<400> 




acid at position 7 may 



NAME/KEY: variant 
LOCATION: (7) . . (8) 
OTHER INFORMATION^ 
FEATURE : 
NAME/KEY: site 
LOCATION: (21) . . (22) 
OTHER 
FEATURE : 

NAME/KEY: variant 
LOCATION: (14)..( 

OTHER INFORMATION^Amino acid at position 

FEATURE : C__ 

NAME/KEY: variant 

LOCATION : ( 18 ) . . ( 19 : 

OTHER INFORMATION :/AmTno acid at position 

FEATURE : ^ — 

NAME/KEY: variant 




ON: (21).. (22) , -^^JT^a^C 

INFORMATION: Asparagine at position 21 is an amino acid insertion yOr\-AJ~c~ 



14 may also be 




18 may also be valTriS 



LOCATION: (19) . . (20) 
OTHER INFORMATION: 
FEATURE : 

NAME/KEY: variant 
LOCATION: (20) . . (21) 
OTHER INFORMATION 
SEQUENCE : 8 
Lys Asn Lys Pro Lys Asp ("Asp 

i /^V^V"^ 5 





acid at position 20 may also be arginin^//<>^^^ 



Ala Ala Ser Thr Lys Lys 
10 

354 Tyr [lle|Trpf Hisy/Asn Phe Phe Phe 

358 <210> SEQ ID NO: 9 
LENGTH: 13 
TYPE: PRT 

Taenia solium 




Glu 



le Glu 
15 ' * 



<211> 
<212> 
<213> 
<220> 
<221> 
<222> 
<223> 
<220> 
<221> 
<222> 
<223> 
<220> 
<221> 
<222> 
<223> 
<220> 
<221> 
<222> 
<223> 
<400> 



variant 
(5) . . (6) 



ORGANISM: 
FEATURE : 
NAME/KEY : 
LOCATION: 
OTHER INFORMATION 
FEATURE: 

NAME/KEY: variant 
LOCATION: (12) . . (13) 
OTHER INFORMATION 
FEATURE : 

NAME/KEY: variant 
LOCATION: (7) . . (9) 
OTHER INFORMATIO: 
FEATURE 

NAME/KEY: site 
LOCATION: (8) . . (9) 
OTHER INFORMATION: 
SEQUENCE: 9 




Amino acid at position 7 may also be asparagine 





Tryptophan at position 8 is an amino acid insertion 
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VERIFICATION SUMMARY DATE: 06/04/2002 

PATENT APPLICATION: US/10/048 , 146A TIME: 16:18:16 

Input Set : A:\62068.app 

Output Set: N:\CRF3\06042002\J048146A.raw 

L:10 M:271 C: Current Filing Date differs, Replaced Current Filing Date 
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